Read my lips: speech distortions in musical lyrics can be overcome (slightly) by facial information

نویسندگان

  • Dominic W. Massaro
  • Alexandra Jesse
چکیده

Understanding the lyrics of many contemporary songs is difficult, and an earlier study [Hidalgo-Barnes, M., Massaro, D.W., 2007. Read my lips: an animated face helps communicate musical lyrics. Psychomusicology 19, 3–12] showed a benefit for lyrics recognition when seeing a computer-animated talking head (Baldi ) mouthing the lyrics along with hearing the singer. However, the contribution of visual information was relatively small compared to what is usually found for speech. In the current experiments, our goal was to determine why the face appears to contribute less when aligned with sung lyrics than when aligned with normal speech presented in noise. The first experiment compared the contribution of the talking head with the originally sung lyrics versus the case when it was aligned with the Festival text-to-speech synthesis (TtS) spoken at the original duration of the song’s lyrics. A small and similar influence of the face was found in both conditions. In the three experiments, we compared the presence of the face when the durations of the TtS were equated with the duration of the original musical lyrics to the case when the lyrics were read with typical TtS durations and this speech embedded in noise. The results indicated that the unusual temporally distorted durations of musical lyrics decreases the contribution of the visible speech from the face. 2008 Elsevier B.V. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Accepted Manuscript Accepted Manuscript Read My Lips: Speech Distortions in Musical Lyrics Can Be Overcome (slightly) by Facial Information

Understanding the lyrics of many contemporary songs is difficult, and an earlier study (Hidalgo-Barnes and Massaro, 2007) showed a benefit for lyrics recognition when seeing a computer-animated talking head (Baldi®), mouthing the lyrics along with hearing the singer. However, the contribution of visual information was relatively small compared to what is usually found for speech. In the current...

متن کامل

Read my lips: an animated face helps communicate musical lyrics

Understanding the lyrics of many contemporary songs is difficult. Watching the talker’s face improves speech understanding when the speech is degraded by noise or hearing difficulty. To explore whether the face can be similarly helpful in music, 34 phrases from the song “The Pressman” by Primus (1993) were played to thirteen college students. These phrases were aligned with Baldi , a computer-a...

متن کامل

Rhyme and Style Features for Musical Genre Classification by Song Lyrics

How individuals perceive music is influenced by many different factors. The audible part of a piece of music, its sound, does for sure contribute, but is only one aspect to be taken into account. Cultural information influences how we experience music, as does the songs’ text and its sound. Next to symbolic and audio based music information retrieval, which focus on the sound of music, song lyr...

متن کامل

Lyrics Recognition from a Singing Voice Based on Finite State Automaton for Music Information Retrieval

Recently, several music information retrieval (MIR) systems have been developed which retrieve musical pieces by the user’s singing voice. All of these systems use only the melody information for retrieval. Although the lyrics information is useful for retrieval, there have been few attempts to exploit lyrics in the user’s input. In order to develop a MIR system that uses lyrics and melody info...

متن کامل

Searching Lyrical Phrases in A-Capella Turkish Makam Recordings

Search by lyrics, the problem of locating the exact occurrences of a phrase from lyrics in musical audio, is a recently emerging research topic. Unlike key-phrases in speech, lyrical key-phrases have durations that bear important relation to other musical aspects like the structure of a composition. In this work we propose an approach that address the differences of syllable durations, specific...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Speech Communication

دوره 51  شماره 

صفحات  -

تاریخ انتشار 2009